Entity Linking: Finding Extracted Entities in a Knowledge Base
نویسندگان
چکیده
In the menagerie of tasks for information extraction, entity linking is a new beast that has drawn a lot of attention from NLP practitioners and researchers recently. Entity Linking, also referred to as record linkage or entity resolution, involves aligning a textual mention of a named-entity to an appropriate entry in a knowledge base, which may or may not contain the entity. This has manifold applications ranging from linking patient health records to maintaining personal credit files, prevention of identity crimes, and supporting law enforcement. We discuss the key challenges present in this task and we present a high-performing system that links entities using max-margin ranking. We also summarize recent work in this area and describe several open research problems.
منابع مشابه
Generating a Large-Scale Entity Linking Dictionary from Wikipedia Link Structure and Article Text
Wikipedia has been increasingly used as a knowledge base for open-domain Named Entity Linking and Disambiguation. In this task, a dictionary with entity surface forms plays an important role in finding a set of candidate entities for the mentions in text. Existing dictionaries mostly rely on the Wikipedia link structure, like anchor texts, redirect links and disambiguation links. In this paper,...
متن کاملDomain-Specific Entity Linking via Fake Named Entity Detection
The traditional named entity detection (NED) and entity linking (EL) techniques cannot be applied to domain-specific knowledge base effectively. Most of existing techniques just take extracted named entities as the input to the following EL task without considering the interdependency between the NED and EL and how to detect the Fake Named Entities (FNEs). In this paper, we propose a novel appr...
متن کاملTAC Entity Linking by Performing Full-document Entity Extraction and Disambiguation
The paper describes the system submitted to TAC 2011 for the English entity linking task of the Knowledge Base Population track. Instead of focusing only on the provided target strings, this system extracts and disambiguates globally all entities from each target document and then maps the target string to one of the entities extracted from the document. The main features employed by the system...
متن کاملHIT Approaches to Entity Linking at TAC 2011
This paper describes the system of HIT at the 2011 Text Analysis Conference (TAC) Knowledge Base Population (KBP) track English Entity Linking task. Based on structured and unstructured information extracted from Wikipedia, this system predicts the most probable entity that a query mention might refer to. A similarity score is assigned to the candidate entity by computing the the relatedness be...
متن کاملJoint Named Entity Recognition and Disambiguation
Extracting named entities in text and linking extracted names to a given knowledge base are fundamental tasks in applications for text understanding. Existing systems typically run a named entity recognition (NER) model to extract entity names first, then run an entity linking model to link extracted names to a knowledge base. NER and linking models are usually trained separately, and the mutua...
متن کامل